Project-Team:CQFD

Project-Team Cqfd

Members

Overall Objectives

Presentation

Research Program

Application Domains

Dependability and safety

New Software and Platforms

New Results

Highlights of the Year
Approximate Kalman–Bucy filter for continuous-time semi-Markov jump linear systems
Modeling and optimization of a launcher integration process
Numerical approximation for optimal stopping of MDP under partial observation
Classification of EEG signals by evolutionary algorithm
Probabilistic low-rank matrix completion with adaptive spectral regularization algorithms
Variable selection to construct indicators of quality of life for data structured in groups
Efficiency of simulation in monotone hyper-stable queueing networks
Control of parallel non-observable queues: asymptotic equivalence and optimality of periodic policies
The economics of the cloud: price competition and congestion
Generalized Nash Equilibria for Platform-as-a-Service Clouds
Stochastic approximations of constrained discounted Markov decision processes
Non-Parametric Estimation of the Conditional Distribution of the Interjumping Times for Piecewise-Deterministic Markov Processes
Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities
Piecewise Deterministic Markov Processes based approach applied to an offshore oil production system
Optimal Trajectories for Underwater Vehicles by Quantization and Stochastic control
Multi-Objective Design and Maintenance Optimization of the Heated Hold-Up Tank Modeled by Piecewise Deterministic Markov Processes
Conditional quantile estimation through optimal quantization
Conditional quantile estimator based on optimal quantization: from theory to practice
QuantifQuantile : an R package for performing quantile regression trough optimal quantization
Transcriptome profile analysis reveals specific signatures of pollutants in Atlantic eels
Comparaison of kernel density estimators with assumption on number of modes : application on environmental monitoring data
A new sliced inverse regression method for multivariate response
An introduction to dimension reduction in nonparametric kernel regression
Hidden Markov Model for the detection of a degraded operating mode of optronic equipment
On the asymptotic behavior of the Nadaraya-Watson estimator associated with the recursive SIR method
Evolving Genetic Programming Classifiers with Novelty Search
Detecting mental states of alertness with genetic algorithm variable selection
A comparison of fitness-case sampling methods for Symbolic Regression
Geometric Semantic Genetic Programming with Local Search

Bilateral Contracts and Grants with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Inria | Raweb 2014 | Presentation of the Project-Team CQFD


	PDF	e-Pub

previous

Home | Next next

next

Section: New Results

Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities

Participants : Francois Dufour, Tomas Prieto-Rumeau.

We consider a discrete-time Markov decision process with Borel state and action spaces, and possibly unbounded cost function. We assume that the Markov transition kernel is absolutely continuous with respect to some probability measure $μ$ . By replacing this probability measure with its empirical distribution $μ_{n}$ for a sample of size $n$ , we obtain a finite state space control problem, which is used to provide an approximation of the optimal value and an optimal policy of the original control model. We impose Lipschitz continuity properties on the control model and its associated density functions. We measure the accuracy of the approximation of the optimal value and an optimal policy by means of a non-asymptotic concentration inequality based on the 1–Wasserstein distance between $μ$ and $μ_{n}$ . Obtaining numerically the solution of the approximating control model is discussed and an application to an inventory management problem is presented. This work has been published in Stochastics An International Journal of Probability and Stochastic Processes: [26] .

previous

Home | Next next

next